Data storage system and method for decoding data based on extrapolated flipped-bit data

ABSTRACT

An error management system for a data storage device can generate soft-decision log-likelihood ratios (LLRs) using multiple reads of memory locations. 0-to-1 and 1-to-0 bit flip count data provided by multiple reads of reference memory locations can be used to generate probability data that is used to generate possible LLR values for decoding target pages. Possible LLR values are stored in one or more look-up tables.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation application of application Ser. No. 14/584,326, filed on Dec. 29, 2014, now U.S. Pat. No. 10,545,879, which is a continuation application of application Ser. No. 13/797,923, filed on Mar. 12, 2013, now U.S. Pat. No. 8,924,824, the disclosure of each of which is hereby incorporated by reference herein in its entirety.

BACKGROUND Technical Field

This disclosure relates to data storage systems. More particularly, the disclosure relates to systems and methods for generating log-likelihood ratios for data storage systems.

Description of the Related Art

Soft-decision low-density parity-check code (LDPC) error code correction (ECC) can improve the reliability of a data storage system and reduce the number of data errors. Log-likelihood ratios (LLRs) are commonly used as the inputs for soft-decision LDPC engines. Data storage systems that use flash memories (e.g., NAND) as data storage media can use LLR calculations for reading memory cells when LDPC based on single-read input is insufficient to decode the originally-stored data.

BRIEF DESCRIPTION OF THE DRAWINGS

Various embodiments are depicted in the accompanying drawings for illustrative purposes, and should in no way be interpreted as limiting the scope of the inventions. In addition, various features of different disclosed embodiments can be combined to form additional embodiments, which are part of this disclosure. Throughout the drawings, reference numbers may be reused to indicate correspondence between reference elements.

FIG. 1A is a block diagram illustrating an embodiment of a combination of a host system with storage device including an error management module.

FIG. 1B is a block diagram illustrating an embodiment of an error management module.

FIG. 2 is a graph showing a probability distribution of cells in a non-volatile solid-state memory array according to one embodiment.

FIG. 3 is a flow diagram for an embodiment of a 1-read soft-decision input generation process.

FIG. 4 is a graph showing a probability distribution of cells in a non-volatile solid-state memory array according to one embodiment.

FIG. 5A is a flow diagram illustrating an embodiment of a process of generating soft-decision LLR inputs using multiple reads.

FIG. 5B is a flow diagram illustrating an embodiment of a process of generating soft-decision LLR inputs using multiple reads.

FIG. 6 is a flow chart illustrating an embodiment of a process for decoding a reference bit stream using hard-decision input.

FIG. 7 provides an embodiment of a cell probability distribution graph.

FIG. 8 provides an embodiment of a cell probability distribution graph for a multiple-read embodiment.

FIG. 9 is a graph illustrating an embodiment of a cumulative distribution function.

FIG. 10 illustrates an embodiment of a voltage distribution graph showing the separation of lower page data into different upper page distributions.

DETAILED DESCRIPTION

While certain embodiments are described, these embodiments are presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the scope of protection.

Overview

Data storage cells in solid state drives (SSD), such as multi-level-per-cell (MLC) flash memory, may have distinct threshold voltage distribution (Vt) levels, corresponding to different memory states. For example, in an MLC implementation, different memory states in an SSD may correspond to a distribution of voltage levels ranging between voltage read levels; when the charge of a memory cell falls within a particular range, one or more reads of the page may reveal the corresponding memory state of the cell. The term “read” is used herein with respect to voltage reads of SSDs according to its broad and ordinary meaning, and may refer to read operations on a page, including a plurality of cells (e.g., thousands of cells), or may be used with respect to a voltage charge level of a single memory cell.

Voltage read levels can advantageously be set to values in the margins between memory states. According to their charge level, memory cells store different binary data representing user data. For example, based on its charge level, each cell generally falls into one of the memory states, represented by associated data bits. Performing cell reads at the various read levels can provide 1-bit input data for identifying the memory states with which certain cells are associated when the distributions for different states are tightly clustered and there is no overlap between them.

Over time, and as a result of various physical conditions and wear from repeated program/erase (P/E) cycles, the margins between the various distribution levels may be reduced, so that voltage distributions overlap to some extent. Such reduction in read margin may be due to a number of factors, such as loss of charge due to flash cell oxide degradation, over-programming caused by erratic program steps, programming of adjacent erased cells due to heavy reads or writes in the locality of the cell (or write disturbs), and/or other factors. When voltage distributions overlap, single-read inputs may not provide enough information to decode the original data.

Soft-decision inputs, such as log-likelihood ratios (LLRs), can enhance the probability of successful decoding in certain situations. However, in certain embodiments, SSDs are not configured to provide soft-decision data. For example, an SSD may present a binary interface, wherein the SSD returns binary data based on reads of the SSD at various voltage threshold levels. Such voltage threshold levels utilized by the SSD may be adjustable or configurable in certain configurations. In order to provide more reliable data input, it may be desirable for an SSD controller to generate soft-decision data based on binary return values provided by SSDs. Certain embodiments disclosed herein provide for soft-decision input generation for a decoding module, such as a low-density parity check (LDPC) decoder, using a multiple-read technique. Certain embodiments further provide systems and methods for SSD channel estimation, as well as optimal Vt tracking for SSDs to improve utilization of the decoder correction capability.

“Page,” or “E-page,” as used herein may refer to the unit of data correction of embodiments disclosed herein. For example, error correction/calibration operations may be performed on a page-by-page basis. A page of data may be any suitable size. For example, a page may comprise 1 k, 2 k, 4 k, or more bytes of data. Furthermore, the term “location,” or “memory location” is used herein according to its broad and ordinary meaning and may refer to any suitable partition of memory cells within one or more data storage devices. A memory location may comprise a contiguous array of memory cells or addresses (e.g., a page).

As used in this application, “non-volatile solid-state memory” may refer to solid-state memory such as NAND flash. However, the systems and methods of this disclosure may also be useful in more conventional hard drives and hybrid drives including both solid-state and hard drive components. Solid-state memory may comprise a wide variety of technologies, such as flash integrated circuits, Phase Change Memory (PC-RAM or PRAM), Programmable Metallization Cell RAM (PMC-RAM or PMCm), Ovonic Unified Memory (OUM), Resistance RAM (RRAM), NAND memory, NOR memory, EEPROM, Ferroelectric Memory (FeRAM), MRAM, or other discrete NVM (non-volatile solid-state memory) chips. The non-volatile solid-state memory arrays or storage devices may be physically divided into planes, blocks, pages, and sectors, as is known in the art. Other forms of storage (e.g., battery backed-up volatile DRAM or SRAM devices, magnetic disk drives, etc.) may additionally or alternatively be used.

System Overview

FIG. 1A is a block diagram illustrating an embodiment of a combination of a host system 110 with storage device 120 incorporating error management functionality. As shown, the storage device 120 includes a controller 130, which in turn includes an error management module 140A. In certain embodiments, the error management module 140A is configured to detect and correct certain kinds of internal data corruption of one or more non-volatile solid-state memory arrays 150. In one embodiment, the error management module 140A may include an encoder (e.g., an LDPC encoder) to encode data written to the memory array 150 and a decoder (e.g., an LDPC decoder) to decode data read from the memory array 150. In one embodiment, the error management module 140A is configured to generate LLRs for cells of the memory array 150 for soft-decision error correction.

In certain embodiments, the controller 130 is configured to receive memory access commands from a storage interface (e.g., driver) 112 residing on a host system 110. The controller 130 is configured to execute commands in response to such host-issued memory commands in the non-volatile solid-state memory arrays 150. Data may be accessed/transferred based on such commands. In one embodiment, the storage device 120 may be a hybrid disk drive that additionally includes magnetic memory storage (not shown). In such case one or more controllers 130 would control the magnetic memory storage and the non-volatile solid-state memory array(s) 150.

FIG. 1B is a block diagram illustrating an embodiment of an error management module 140B, which may correspond to the error management module 140A shown in FIG. 1A. In certain embodiments, the error management module 140B includes a soft-decision input generator 142 for generating soft decision input that may be used by a decoder module 149. The soft-decision input generator 142 may be configured to quantize binary inputs to multiple-bit values, which may be used to provide multiple-bit input to the decoder 149. Therefore, the soft-decision input generator 142 may be configured to emulate soft-decision input using binary values. In certain embodiments, soft-decision decoding provides improved performance with respect to comparable hard decision decoding techniques.

The error management module 140B includes a calibration module 144 as well as one or more indexes for storing look-up tables (LUTs) containing possible soft-decision input values. The error management module 140B may further include a module (not shown) for tracking voltage threshold levels for determining relative optimal voltage threshold levels. For example, the voltage threshold tracking module may report one or more E-page reads having minimum bit-flip error counts from among a plurality of E-page reads associated with different voltage read levels. For example, a page A may be read three times at different voltage read levels, resulting in three read copies A1, A2, and A3 and the tracking module may find that A1 results in the fewest bit-flip error as compared the decoded page A.

The error management module 140B may be configured to perform soft-decision calibration using the calibration module 144, wherein possible LLR values are stored by the calibration module 144 in one or more local and/or remotely-stored LUTs, which can be used by the error management module 140B for decoding target data as needed. When a target page cannot be decoded using hard-decision input, the appropriate LUT may be provided to the soft-decision input generator with the target data. Calibration may be performed periodically using a background process, and may be performed on a per-block, or per-die basis. For example, a calibration can be performed based on a reference page within each of blocks B1, B2, BN. If a read on a page (the target page) in block B3 encounters an error, then the soft-decision data based on calibration on the reference page within B3 can be used to help decode that target page in block B3. For example, calibration may performed according to a firmware-based schedule, which may dictate calibration on a daily, weekly, monthly, or other basis.

The calibration module 144 is configured to generate possible LLR values based on one or more reads of a reference memory location and comparison of such read values with known reference values stored in the reference memory location. The calibration module 144 may further populate one or more LUTs 148 with the generated LLR values. The LLR values stored in the LUT(s) may be used by the error management module 140B as inputs to the decoder 149 in association with the decoding of target memory locations related to the reference location. In addition to the index 148, one or more indexes/LUTs may be stored externally to the error management module 140B. For example, multiple LUTs, each associated with different reference locations may be stored externally. When the error management is tasked with decoding a particular target location, the LUT associated with the corresponding reference location may be accessed and used by the error management module 140B to provide soft-input values to the decoder 149. LUTs may be any suitable size. For example, in certain embodiments, LUTs have dimensions of n×m, wherein n may correspond to the maximum number of multi-reads supported+1 and m may correspond to the bit width of the LLR, which in one embodiment is equal to the decoder's internal precision. In one embodiment, the dimension used is 8×8.

The calibration module 144 may include functionality for performing certain statistical analysis for 1-read and/or multiple-read decoding operations. For example, such statistical analysis may include determining one or more of the following: total bit-error count, 1-to-0 flip count, and 0-to-1 flip count, wherein total bit-error count is the sum of 1-to-0 and 0-to-1 flip counts. The calibration module 144 may make such determinations based at least in part on LLR input values stored in the index 148, as well as information from the decoder 149. In certain embodiments, the calibration module 144 determines which among a plurality of voltage level reads of a location has the minimum total bit-error count. The corresponding voltage read level may be considered the optimal read level for the location.

FIG. 2 is a graph showing a probability distribution of cells in a non-volatile solid-state memory array according to one embodiment. Flash memory, such as multi-level cell (MLC) NAND flash memory, may store two or more bits of information per cell. While certain embodiments disclosed herein are described in the context of MLCs, it should be understood that the concepts disclosed herein may be compatible with single level cell (SLC), three-level cell (TLC) technology (a type of MLC NAND), and/or other types of technology. Data is generally stored in MLC NAND flash memory in binary format. For example, two-bit-per-cell memory cells can have 4 distinct voltage (Vt) levels, and 3-bit-per-cell memory cells can have 8 distinct Vt levels, and so on. According to their Vt, and the coding associated with their Vt, memory cells store different binary bits.

The horizontal axis depicted in FIG. 2 represents cell voltage level. The vertical axis represents the number of cells that have the corresponding voltage values. Thus, the four distribution curves represent the number of cells, broken down by the four distributions, which have the corresponding voltage values. As shown, the voltage distribution of the memory cells may include a plurality of distinct levels, or states (e.g., States 0-3 in this example 2-bit-per cell MLC configuration, as shown). Read reference values (i.e., voltage threshold levels R1-R3) may be placed between these levels. The gap between the levels (i.e., margin between programmed states), in which the read voltage references may advantageously be positioned in certain embodiments, is referred to as “read margin.”

Over time, and as a result of various physical conditions and wear, for example from being subjected to repeated P/E cycles, the read margins between the various distribution levels may be reduced, resulting in both data retention problems and higher read errors beyond certain limits. Such reduction in read margin may be due to a number of factors, such as loss of charge due to flash cell oxide degradation, over-programming caused by erratic program steps, programming of adjacent erased cells due to heavy reads or writes in the locality of the cell (or write disturbs), and/or other factors.

While the diagram of FIG. 2 illustrates a Vt distribution for 2-bit-per-cell flash memories, embodiments and features disclosed herein may be applicable to other types of coding schemes. With respect to the embodiment of FIG. 2, the coding for States 0-3 can be, for example, ‘11,’ ‘01,’ ‘00,’ and ‘10,’ or any other coding. Each cell may generally fall into one of the illustrated states and correspondingly represents two bits. For one word line (WL), which can be connected to tens of thousands of cells in a NAND array, the lower digit of the cells may be referred to as the “lower page,” and the upper digit may be referred to as the “upper page.” For 3-bit-per-cell flash memories, there may also be intermediate digits, which may be referred to as “middle pages.” Reading voltage levels and operations are dependent on the coding of these states. For example, for the coding as shown in FIG. 2 for the 2-bit-per-cell flash memories, one read at R2 may be required to read out the lower page, and two reads at both R1 and R3 may be required to read out the upper page. As shown in the distribution of FIG. 2, these reading voltages may be selected between state distributions in the case where the distributions for different states are narrow so that there is no overlap between them.

Soft Decision Input Generation—Scaled Approach

With further reference to FIG. 1B, the soft-decision input generator 142 may be configured to generate soft-decision inputs based on a single read of a memory location, wherein multiple-bits of information are generated based on a single binary value. In such a configuration, binary values returned by memory arrays may be up-scaled in order to generate the appropriate number of bits. For example, binary values of 0 and 1 may be up-scaled by a scale factor corresponding to the internal bit-width of the decoder.

FIG. 3 is a flow diagram for an embodiment of a 1-read soft-decision input generation process 300. The process 300 includes reading a target page at a predetermined voltage threshold level to obtain a 1-bit binary data value at block 302. In certain embodiments, the 1-bit values are mapped to the values +1 and −1 at block 304. For example, a binary value of ‘0’ may be mapped to −1, while the binary value of ‘1’ may be mapped to +1. The values are then scaled up by a certain scale factor in order to generate the desirable number of bits of data for use by the relevant decoder, such as an LDPC decoder. By such procedure, each 1-bit data value may be converted to an n-bit value, where n is the internal bit-width of the decoder core.

Soft Decision Input Generation—Multi-Read Approach

Single-read soft-decision input generation may not provide sufficient or desirable information for certain purposes. Therefore, certain embodiments disclosed herein provide a soft-decision input generator that is configured to generate soft-decision inputs based on multiple reads of a memory location, thereby providing multiple bits of data from which to generate soft-decision inputs. FIG. 4 is a graph showing a probability distribution of cells in a non-volatile solid-state memory array illustrating a multiple-read scheme. As discussed above, due to the memory wearing out, loss of data retention, and the like, the states of a voltage distribution can widen and overlap. Reading at a single read voltage (e.g., R1) may not be enough to decode the original data, as one or more of the probability distributions cross over the read level. Therefore, utilizing certain ECC schemes, such as, single-read LDPC, may be insufficient to correctly decode the data. Although multiple-read schemes may be computationally complex and time-consuming, multiple reads can provide increased performance in certain situations with respect to single-read solutions.

In the multiple-read scheme illustrated in FIG. 4, three reads are involved, with reading voltages at R0, R1, and R2. These three reading voltages divide the distribution shown into four zones (e.g., zones 1-4, from left to right). Although three reading voltages are illustrated in FIG. 4, certain embodiments may include more than three reading voltages, such that the distribution may be divided into more than four zones. For example, 4, 5, 6, 7 or more reads may be taken in association with a junction between memory states. Flash cells having charge levels in the different zones may return different values corresponding to the respective zone. For example, in certain embodiments, flash cells read having a charge level within zone 1 may return ‘1’ for each of the three reads, thereby providing three bits of information (‘111’); cells read having a charge level within zone 2 return ‘011’; cells read having a charge level within zone 3 return ‘001’; and cells read having a charge level within zone 4 return (‘000’). If the data are known, the LLRs for these 4 groups of number combination may be obtained. For example, if there are a total of N cells in zone 1, among which the real values of m cells are 0, LLRs can be determined using the following equation: LLR(111)=log(m/N−m). Although FIG. 4 shows three reads and four zones, more reads and zones are possible in other embodiments, and the LLRs may be generated in a similar manner to that described above.

FIG. 5A is a flow diagram illustrating an embodiment of a process 500A of generating soft-decision LLR inputs using multiple reads. The process 500A is described with reference to FIG. 1B for illustrative purposes, and may be performed by the error management module 140B of FIG. 1B. In one embodiment, the process begins at block 502 where correct reference data is determined. Then in block 504, multiple reads with different voltage thresholds are performed on the reference data. In block 506, probability data are obtained based on comparison of bit flip count data. In block 508, the obtained probability data are applied to decode target memory location(s). During decoding of a memory location, read errors may be encountered and soft-decision data may be used for decoding data stored in the memory location. Therefore, in certain embodiments, part of the process 500A (e.g., block 508) is performed after hard-decision error correction fails with respect to one or more target memory locations. Other parts of the process 500A may be performed periodically, apart from the occurrence of a failure to hard-decision error correction.

FIG. 5B is a flow diagram illustrating another embodiment of a process 500B of generating soft-decision LLR inputs using multiple reads. The process 500B includes determining a known reference bit stream recorded in a reference memory location at block 512 and performing a plurality reads of the reference location at a plurality of voltage read levels to obtain streams of bits corresponding to each of the reads (block 514). The process 500B may further include calculating 1-to-0 and 0-to-1 bit flip count data for each voltage read level based on the bit streams (block 516). Possible LLR values are generated at block 518 based on the bit flip count data. At block 520 a sequence of LLRs based at least in part on the possible LLRs and on bit patterns from a multiple read operation on the target memory location is generated and data from a target memory location is decoded using the generated sequence of LLRs.

Multi-Read Approach—Determine Correct Reference Data

The process 500A includes determining correct reference data at block 502, while the process 500B includes determining a known reference bit stream recorded in a reference memory location at block 512. Such reference data may provide a basis for calibration of the soft-decision input generator 142 of FIG. 1B. For example, in certain embodiments, calculated possible LLR values are based on a reference memory location, or known channel, wherein the LLR values are applicable to target memory locations sharing one or more data retention characteristics of the reference location. For example, the reference memory location may have experienced similar environmental conditions, age and/or read/write cycling as the target location(s). The reference location may be located in physical proximity to target locations that are decoded based on analysis of the reference location. In certain embodiments, target locations are located within the same block of data as the reference location. That the reference location has similar device usage history to target locations can be important so that the information collected from the reference location can at least partly reflect the retention variables experienced by the related locations closely enough such that the reference location can be considered a valid reference with respect to the related memory locations.

Determining the correct reference data may include determining a bit stream stored in the reference memory location for reference during calibration of the soft-decision input generator 142. Any suitable or desirable mechanism for determining the correct value of the reference bit stream may be used. For example, the reference memory location may be assigned by the system prior to calibration of the soft-decision input generator 142, and may be pre-programmed with a known bit stream.

Alternatively, in certain embodiments, the reference location does not contain a pre-programmed known bit stream. Instead, the reference bit stream may be determined by decoding the data in the reference location during calibration, as will be further explained below with respect to FIG. 6. In certain embodiments, the error management module 140B is configured to decode the reference memory location using hard-decision techniques. If a selected reference location is not able to be decoded correctly using hard-decision techniques, the calibration module 144 may be configured to select a different memory location to serve as the relevant reference location.

FIG. 6 is a flow chart illustrating an embodiment of a process 600 for decoding a reference bit stream using hard-decision input, which may be performed as an initial, or early, stage of calibration (e.g., as part of block 502 in FIG. 5A or block 512 in FIG. 5B). The process 600 includes reading the reference memory location/page using a default voltage read level. For example, in a 3-read scheme, the default read level may correspond to the intermediate read level. At block 604 it is determined whether the page can be decoded using the default read level to provide hard-decision input. If so, then the process 600 is complete, as the reference page can therefore be correctly decoded. If the page cannot be decoded using hard-decision input, the process 600 includes attempting to decode the page using higher and/or lower voltage read levels, wherein the process 600 terminates upon successful decoding. In certain embodiments, the process 600 loops until the page can be decoded successfully using hard-decision input. Upon completion of the process 600, further calibration steps may be performed, as discussed herein.

Multi-Read Approach—Perform Multi-Read on Reference Data

With further reference to FIG. 5A, once the reference bit stream stored in the reference memory location has been determined, the process 500A includes performing multiple reads on the reference data at block 504. Similarly, in FIG. 5B, in block 514 the process 500B performs a plurality reads of the reference location at a plurality of voltage read levels to obtain streams of bits corresponding to each of the reads. Such reads may be performed in order to generate a plurality of bit streams corresponding to values returned in connection with the plurality of reads.

Once the multiple reads are completed, the process 500A includes calculating LLR values based determining the bit flip counts of each of the multiple reads through comparisons of the multiple-read values to the correct reference data at block 506. For example, each of the reads of the reference location can be compared to the known reference data to determine how many of the cells are flipped from ‘0’ to ‘1,’ and vice versa. Table A shows example bit streams obtained through reads of a reference memory location at three different read voltage levels (R0, R1, R2). The top row of bits reflects the correct data stored in the reference memory location. The three voltage read levels may illustratively correspond to those shown in FIG. 4, wherein the three voltage read levels divide the relative distribution spectrum into four zones represented by 3-bit bit patterns.

TABLE A Reference E-Page Bit Stream Correct Data 0 1 0 0 1 0 R₀ 0 1 0 0 0 0 R₁ 0 1 1 1 1 0 R₂ 1 1 1 1 1 0

While the table shows 6-bit bit streams in each of the rows, such bit stream size is provided for illustration purposes only, and a given memory location of interest may be significantly larger, comprising thousands of bits or more, for example. Furthermore, more than three reads may be used, or two reads may be used. Calibrating the soft-decision input generator using the data of Table A may include determining the following bit flip counts with respect to the correct reference data: (1) the 0-to-1 bit flip count at R0; (2) the 1-to-0 bit flip count at R0; (3) the 0-to-1 bit flip count at R1; (4) the 1-to-0 bit flip count at R1; (5) the 0-to-1 bit flip count at R2; and (6) the 1-to-0 bit flip count at R2. With respect to the 6-bit example shown in Table A, the values of each of the six calculated data point would be: (1)=0, i.e., no bits flipped from ‘0’ to ‘1’ for R0; (2)=1 (bit five is flipped from ‘1’ to ‘0’ for R0); (3)=2 (bits three and four are flipped from ‘0’ to ‘1’ for R1); (4)=0; (5)=3 (bits one, three and four are flipped from ‘0’ to ‘1’ for R2); and (6)=0. In embodiments including more or fewer reads, the number of bit flip counts calculated may be different. Generally, the number of data points calculated is approximately equal to 2×M, where M is the number of reads at different voltage levels.

Multi-Read Approach—Calculate LLR Values

Certain embodiments disclosed herein provide for probability data (e.g., LLR) calculation based on calculated 1-to-0 and 0-to-1 bit flip counts. This may correspond to in block 506 in FIG. 5A in one embodiment, and to blocks 516 and 518 of in FIG. 5B in another embodiment. FIG. 7 provides a cell probability distribution graph that may be helpful in illustrating the application of bit flip counts to LLR calculation algorithms. In the distribution of FIG. 7, a ‘1’ distribution and a ‘0’ distribution overlap in a region of the voltage spectrum. In an example embodiment, a page is read at a voltage read level Vt, as shown. With respect to a read of the page at Vt, the area 605 of the ‘0’ distribution represents the number of originally-programmed ‘0’ cells that have flipped to ‘1,’ whereas the region 607 represents the number of originally-programmed ‘1’ cells that have flipped to ‘0.’ In certain embodiments, it is assumed that the occurrences of 0/1 in the reference bit stream is randomized such that the number of 0's and 1's is approximately equal. Therefore, the variable 0_px, which is used herein to refer to the probability that a ‘0’ in the reference bit stream has flipped to ‘1’ at voltage read level Vtx, as well as the probability that a ‘1’ in the reference bit stream has flipped to ‘0,’ can be represented as follows:

${0{\_ px}} = {\frac{0{\_ to}\_ 1({VTx})}{{total\_ correct}\_ 0{\_ bits}} \cong \frac{0{\_ to}\_ 1({VTx})}{0.5*{total\_ bits}}}$ ${1{\_ px}} = {\frac{1{\_ to}\_ 0({VTx})}{{total\_ correct}\_ 1{\_ bits}} \cong \frac{1{\_ to}\_ 0({VTx})}{0.5*{total\_ bits}}}$

With reference to FIG. 1B, the calibration module 144 may be configured to determine the 1-to-0 and 0-to-1 flip count in the decoder 149 after decoding procedure is done. As shown in FIG. 7, 1-to-0 count represents the integrated probability that ‘1’ is flipped to ‘0’ when the reference threshold is at Vt, while the 0-to-1 count represents the integrated probability that ‘0’ is flipped to ‘1’ when the reference threshold is at VT. Furthermore, 1-to-0 or 0-to-1 counts may represent the value in the cumulative distribution function (CDF).

The LLR (Log Likelihood Ratio) may be defined as

${\ln\left( \frac{P(0)}{P(1)} \right)},$ where P(0) is the probability that a transmitted bit is a ‘0’ value, while P(1) is the probability that the transmitted bit has a value of ‘1.’ It may therefore be convenient to use bit counts directly since the relevant probabilities are proportional to the count number. Based on Bayes' theorem, P(0)=Pr(ci=0|y0(i), y1(i), . . . ym−1(i))=Pr(y0(i), y1(i), . . . ym−1(i))|ci=0)*Pr(ci=0)/Pr(y0(i), y1(i), . . . ym−1(i))=K*Pr(y0(i), y1(i), . . . ym−1(i) ci=0).

With respect to the distribution graph of FIG. 7, for a cell associated with a returned value of ‘1’ from a voltage read at VT, the value of P(0) may be represented by the area 605 divided by the sum of the area of the ‘1’ distribution on the left side of VT (i.e., not including the area of the region 607) plus the area 605. Therefore, as the areas 605 and 607 are provided by calculating the 1-to-0 and 0-to-1 bit flip counts, such counts can be used to calculate LLR values according to the LLR equation in the preceding paragraph.

In a multiple-read scheme, the LLR calculations may be somewhat more complex. FIG. 8 provides a cell probability distribution graph for a multiple-read embodiment. As defined above, the bit flip counts of the form Y_px shown in the graph are defined to represent the probability that a reference bit having an original value of ‘Y’ has flipped when read at a voltage read level VTx. Table B provides a mathematical representation of the values of each of the variables identified in the graph:

TABLE B LLR Calibration Algorithm Based on Probability Density Function 1_p0 = ∫_(+∞) ^(V) _(T0) p₁ (x)dx 0_p0 = ∫_(−∞) ^(V) _(T0) p₀ (x)dx 1_p1 = ∫_(+∞) ^(V) _(T1) p₁ (x)dx 0_p1 = ∫_(−∞) ^(V) _(T1) p₀ (x)dx 1_p2 = ∫_(+∞) ^(V) _(T2) p₁ (x)dx 0_p2 = ∫_(−∞) ^(V) _(T2) p₀ (x)dx 1_p3 = ∫_(+∞) ^(V) _(T3) p₁ (x)dx 0_p3 = ∫_(−∞) ^(V) _(T3) p₀ (x)dx 1_p4 = ∫_(+∞) ^(V) _(T4) p₁ (x)dx 0_p4 = ∫_(−∞) ^(V) _(T4) p₀ (x)dx 1_p5 = ∫_(+∞) ^(V) _(T5) p₁ (x)dx 0_p5 = ∫_(−∞) ^(V) _(T5) p₀ (x)dx 1_p6 = ∫_(+∞) ^(V) _(T6) p₁ (x)dx 0_p6 = ∫_(−∞) ^(V) _(T6) p₀ (x)dx

Where p1(x) and p0(x) are possibility density functions. Furthermore, it holds true that: 1=∫_(+∞) ^(V) ^(−∞) p ₁(x)dx And 1=∫_(−∞) ^(V) ^(+∞) p ₀(x)dx

Although FIG. 8 illustrates a 7-read scheme, the principles described herein may be applicable to any suitable number of reads. With respect to the 7-read embodiment illustrated, the LLR equations for each of the zones shown and represented by 7-bit values may be as follows: LLR(0000000)=C*log2((1−0_p6)/1_p6); LLR(0000001)=C*log2((0_p6−0_p5)/(1_p5−1_p6)); LLR(0000011)=C*log2((0_p5−0_p4)/(1_p4−1_p5)); LLR(0000111)=C*log2((0_p4−0_p3)/(1_p3−1_p4)); LLR(0001111)=C*log2((0_p3−0_p2)/(1_p2−1_p3)); LLR(0011111)=C*log2((0_p2−0_p1)/(1_p1−1_p2)); LLR(0111111)=C*log2((0_p1−0_p0)/(1_p0−1_p1)); LLR(1111111)=C*log2((0_p0)/(1−1_p0));

C is a constant, which may be used to convert the log2(x) to 1 n(x) and/or other scaling purposes. These values may serve as the content of the associated LLR LUT(s). The thresholds of VT are arranged from low to high. If the VT order is changed, the order of bit value representations (e.g., 0000000, 0000001, 0000011, etc.) may have to be changed as well.

These calculated possible LLR values may be used to populate one or more LUTs, as described above. Furthermore, the system may be configured to dynamically update LUT entries based on ongoing calibration/decoding operations. In certain MLC embodiments, memory cells comprise upper and lower pages of data, wherein separate LUTs are generated for upper and lower page values. When reading upper page values, the controller 130 (FIG. 1A) may reference lower page data in order to distinguish between read levels R1 and R3, as illustrated in FIG. 2.

Multi-Read Approach—Apply LLR Values to Target Memory Location(s)

With further reference to FIG. 5A, at block 508 of the process 500A, the recorded possible LLR values are applied to decode additional target memory locations that are determined to have retention characteristics similar enough to those of the reference memory location that similar bit pattern probability characteristics can be assumed. For example, when the error management module 140B is unable to decode a memory location/page using hard-decision input, the error management module 140B identifies the LUT(s) associated with such location, looks up the bit patterns that occur based on multiple reads of the memory location, and provides a sequence of LLR values associated with the bit patterns in the LUT(s). In some embodiments, at least a portion of the possible LLR values generated from block 506 is selected at block 508 and applied to the decoding of the target memory location. For example, given this lookup table (LUT) of possible LLR values:

Bit Pattern LLR Value 000 V0 001 V1 011 V2 111 V3 and three reads of a target page with the following bits returned, LLR value will be selected by matching each bit pattern to the lookup table. For example, the first bit pattern is “000” (three reads of the first cell of the page) which returns the LLR value of V₁. The pattern “010” may be considered an illegal state and is mapped one of the values of the lookup table.

E0 0 0 1 0 0 . . . E1 0 0 1 1 1 E2 1 0 1 1 0 LLR V1 V0 V3 V2 V2 value

Similarly in FIG. 5B, at block 520 a sequence of LLRs based at least in part on the possible LLRs (from block 518) and on bit patterns from a multiple read operation on the target memory location is generated and data from a target memory location is decoded using the generated sequence of LLRs.

Bit Flip Count Interpolation/Extrapolation

Lower page 1-to-0 and 0-to-1 bit flip counts may be interpolated and/or extrapolated using a cumulative distribution function (CDF). FIG. 9 is a graph illustrating an embodiment of a CDF showing bit counts for two distinct reads, VT1, VT2, separated by ΔVT. In certain embodiments, the portion of the CDF between the two reads can be expressed by an approximately-liner relationship in the log domain. For example, the CDF (overlapped area) may be expressed by the following equation: ln(y)=A+B*VT. (y is, e.g., the 1-to-0 bit flip count). Therefore, based on geometry, it can get that the slope, B={ln(y1)−ln(y2)}/ΔVT. The known points, y1 or y2 can be used to derive the value of A in the above equation. For example, the following equation may be used to derive the value of A:

$\begin{matrix} {A = {{\ln(y)} - {B*V_{T}}}} \\ {= {{\ln\left( y_{2} \right)} - {{\left\{ {{\ln\left( y_{1} \right)} - {\ln\left( y_{2} \right)}} \right\}/\Delta}\; V_{T}*V_{T\; 2}}}} \\ {= {{\left\{ {{{\ln\left( y_{2} \right)}*\Delta\; V_{T}} - {{\ln\left( y_{1} \right)}*V_{T\; 2}} + {{\ln\left( y_{2} \right)}*V_{T\; 2}}} \right\}/\Delta}\; V_{T}}} \\ {= {{\left\{ {{{\ln\left( y_{2} \right)}*\left( {{\Delta\; V_{T}} + V_{T\; 2}} \right)} - {{\ln\left( y_{1} \right)}*V_{T\; 2}}} \right\}/\Delta}\; V_{T}}} \end{matrix}$

Therefore, an extrapolated flipped bit count value yx can be determined based on the line equation above.

The interpolation point yz may be derived according to similar principles. Therefore, given two or more 1-to-0 or 0-to-1 bit flip count data points, it may be possible to estimate the 1-to-0 or 0-to-1 count corresponding to an interpolated or extrapolated data point.

Upper page 1-to-0 and 0-to-1 counts may be derived the same way as above, except that each count is split into two data. For example, one count may correspond to the ‘0’ cell value for the lower page, and the other may correspond to the ‘1’ cell value for the lower page.

In certain embodiments, once the system has interpolated and/or extrapolated additional virtual voltage reading threshold values, the calibration logic may be directed to calculate LUT values in a similar way as regular LUT calibration, discussed above. Such LUTs may be used in a similar manner to those discussed above.

FIG. 10 illustrates an embodiment of a voltage distribution graph showing the separation of lower page data into different upper page distributions. Generally, 1-to-0 and 0-to-1 counts are not directly related with low/up lobe crossing. As shown, VT1 and VT3 may be used to determine 0/1 value in the upper page. The areas illustrated represent the following bit flip counts: the area 1001 represents the 0-to-1 bit flip count with lower page bit equal to ‘1’; the area 1002 represents the 1-to-0 bit flip count with lower page bit equal to ‘1’; the area 1003 represents the 1-to-0 bit flip count with lower page bit equal to ‘0’; and the area 1004 represents the 0-to-1 bit flip count with lower page bit equal to ‘0,’ each of which may be derived from reads at the various voltage read levels illustrated. The read level VT2 may be used as the lower page reference threshold. In certain embodiments, provided the correct lower page value for each memory cell, it may be possible to determine whether a given 1-to-0 or 0-to-1 bit flip happens at threshold VT1 or VT3.

Other Variations

The read levels, states, and coding schemes associated with voltage level distributions described herein, as well as variables and designations used to represent the same, are used for convenience only. As used in this application, “non-volatile solid-state memory” typically refers to solid-state memory such as, but not limited to, NAND flash. However, the systems and methods of this disclosure may also be useful in more conventional hard drives and hybrid hard drives including both solid-state and hard drive components. The solid-state storage devices (e.g., dies) may be physically divided into planes, blocks, pages, and sectors, as is known in the art. Other forms of storage (e.g., battery backed-up volatile DRAM or SRAM devices, magnetic disk drives, etc.) may additionally or alternatively be used.

Those skilled in the art will appreciate that in some embodiments, other types of data storage systems and/or data retention monitoring can be implemented. In addition, the actual steps taken in the processes shown in FIGS. 4 and 6 may differ from those shown in the figures. Depending on the embodiment, certain of the steps described above may be removed, others may be added. Accordingly, the scope of the present disclosure is intended to be defined only by reference to the appended claims.

While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of protection. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms. Furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the protection. For example, the various components illustrated in the figures may be implemented as software and/or firmware on a processor, ASIC/FPGA, or dedicated hardware. Also, the features and attributes of the specific embodiments disclosed above may be combined in different ways to form additional embodiments, all of which fall within the scope of the present disclosure. Although the present disclosure provides certain preferred embodiments and applications, other embodiments that are apparent to those of ordinary skill in the art, including embodiments which do not provide all of the features and advantages set forth herein, are also within the scope of this disclosure. Accordingly, the scope of the present disclosure is intended to be defined only by reference to the appended claims. 

What is claimed is:
 1. A method for a data storage system, the method comprising: performing a plurality of reads on the data storage system to generate a plurality of read bit streams, wherein each of the plurality of reads is performed at a respective voltage read level; for each of the plurality of read bit streams, determining flipped-bit data, based on a respective read bit stream and a reference bit stream, wherein the flipped-bit data is different from likelihood ratios (LLRs); extrapolating flipped-bit data based on the determined flipped-bit data for the plurality of read bit streams; generating reference LLR values based on the determined flipped-bit data for the plurality of read bit streams and the extrapolated flipped-bit data, wherein the reference LLR values are different from the extrapolated flipped-bit data; and decoding data in the data storage system using the reference LLR values.
 2. The method of claim 1, further comprising: determining the reference bit stream from a reference memory location.
 3. The method of claim 1, wherein the determining flipped-bit data comprises comparing the respective read bit stream and the reference bit stream.
 4. The method of claim 2, wherein the reference LLR values are associated with the reference memory location.
 5. The method of claim 2, further comprising: prior to the decoding, storing the reference LLR values, wherein the reference memory location is within the data storage system, and wherein the decoding data comprises decoding the data stored in the data storage system using the stored reference LLR values.
 6. The method of claim 1, further comprising: performing a plurality of reads of a target memory location to generate a target read bit stream; generating a sequence of LLR values for decoding the target memory location based at least in part on the reference LLR values and on the target read bit stream; and decoding target data from the target memory location using the generated sequence of LLR values.
 7. The method of claim 6, further comprising determining the reference bit stream from a reference memory location, wherein the reference memory location and the target memory location are located in a same memory block of a non-volatile memory of the data storage system.
 8. The method of claim 6, further comprising determining the reference bit stream from a reference memory location, wherein read or write cycling characteristics of the reference memory location and the target memory location are similar.
 9. A data storage system, comprising: a non-volatile memory; and a controller configured to cause: performing a plurality of reads on the data storage system to generate a plurality of read bit streams, wherein each of the plurality of reads is performed at a respective voltage read level; for each of the plurality of read bit streams, determining flipped-bit data, based on a respective read bit stream and a reference bit stream, wherein the flipped-bit data is different from likelihood ratios (LLRs); extrapolating flipped-bit data based on the determined flipped-bit data for the plurality of read bit streams; generating reference LLR values based on the determined flipped-bit data for the plurality of read bit streams and the extrapolated flipped-bit data, wherein the reference LLR values are different from the extrapolated flipped-bit data; and decoding data in the data storage system using the reference LLR values.
 10. The data storage system of claim 9, wherein the controller is configured to cause determining the reference bit stream from a reference memory location.
 11. The data storage system of claim 9, wherein the determining flipped-bit data comprises comparing the respective read bit stream and the reference bit stream.
 12. The data storage system of claim 10, wherein the reference LLR values are associated with the reference memory location.
 13. The data storage system of claim 10, wherein the controller is configured to cause, prior to the decoding, storing the reference LLR values, wherein the reference memory location is within the non-volatile memory, and wherein the decoding data comprises decoding the data stored in the non-volatile memory using the stored reference LLR values.
 14. The data storage system of claim 9, wherein the controller is configured to cause: performing a plurality of reads of a target memory location to generate a target read bit stream; generating a sequence of LLR values for decoding the target memory location based at least in part on the reference LLR values and on the target read bit stream; and decoding target data from the target memory location using the generated sequence of LLR values.
 15. The data storage system of claim 14, wherein the controller is configured to cause: determining the reference bit stream from a reference memory location, and wherein the reference memory location and the target memory location are located in a same memory block of a non-volatile memory of the data storage system.
 16. The data storage system of claim 14, wherein the controller is configured to cause: determining the reference bit stream from a reference memory location, and wherein read or write cycling characteristics of the reference memory location and the target memory location are similar.
 17. A data storage system, comprising: means for performing a plurality of reads on the data storage system to generate a plurality of read bit streams, wherein each of the plurality of reads is performed at a respective voltage read level; means for determining flipped-bit data for each of the plurality of read bit streams, based on a respective read bit stream and a reference bit stream, wherein the flipped-bit data is different from likelihood ratios (LLRs); means for extrapolating flipped-bit data based on the determined flipped-bit data for the plurality of read bit streams; means for generating reference LLR values based on the determined flipped-bit data for the plurality of read bit streams and the extrapolated flipped-bit data, wherein the reference LLR values are different from the extrapolated flipped-bit data; and means for decoding data in the data storage system using the reference LLR values.
 18. The data storage system of claim 17, further comprising: means for performing a plurality of reads of a target memory location to generate a target read bit stream; means for generating a sequence of LLR values for decoding the target memory location based at least in part on the reference LLR values and on the target read bit stream; and means for decoding target data from the target memory location using the generated sequence of LLR values.
 19. The data storage system of claim 17, further comprising: means for determining the reference bit stream from a reference memory location.
 20. The data storage system of claim 19, wherein the means for determining flipped-bit data comprises means for comparing the respective read bit stream and the reference bit stream, and wherein the reference LLR values are associated with the reference memory location. 